Adara Challenge Analysis

1. Exploratory Data Analysis

1.1 Data Dictionary

The data contains events from the months of January 2019 and January 2021; the 2019 can be used as a basis of comparison for the activity in 2021.

The destinations contained in the data are selected top-tier cities in Germany, UK, France, Italy, and Spain

The table below shows the definition of attributes:

Attributes Definition
event_date The date of the search or booking activity. This is different from the departure_checkin_date, which is the actual date of travel activity: if a customer makes a booking today for check-in on 1 October, today’s date would be the event date, and 1 October would be the “departure_checkin_date”
activity_group An indicator of the type of event on the hotel website. Classified as “Searching” (a general search for a room) or “Booking” (an actual reservation)
reason_for_travel_detailed this is the Reason for Travel, categorized as either “Business”, “Leisure-Non-Family” (leisure travel consisting of 1 or 2 people), and “Leisure-Family” (leisure travel consisting of 3 or more people)
origin_country_code 2-letter ISO country code indicating the origin country from which the booking or search activity was made
hotel_city,hotel_state,hotel_country raw information specifying the destination in question.
Note that this data is in an unclean state, provided straight from the source system.
traveler_value_group Indicator of traveler value (overall spend + loyalty status), grouped into “Low”, “Medium”, “High”
departure_checkin_date Start date of hotel stay; see _eventdate to determine how the date information differs.
total_number_events Aggregated count of total number of events

1.2 Data Exploration

Import Python packages and raw data

Summary Statistics

Data Wrangling

Event Year EDA

Event Month Day EDA

Event Weekday EDA

Activity Group EDA

Reason for Travel Detailed EDA

Origin Country Code EDA

Hotel City EDA

Hotel State EDA

Hotel Country EDA

Traveler Value Group EDA

Checkin Year EDA

Checkin Month EDA

Total Number of Events Histogram

1.3 Data Cleaning

Clean Hotel City

Merged the following cities:

Clean Hotel Country

Issues:

Final Clean Data

2. Insights Discovery

2.1 Booking

2.1 Searching

3. Actionable Recommendations